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Applicant(s) 
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Examiner 
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Art Unit 

1634 



-- The MAILING DATE of this communication appears on the cover sheet with the correspondence address - 
Period for Reply 

A SHORTENED STATUTORY PERIOD FOR REPLY IS SET TO EXPIRE 3 MONTH(S) FROM 
THE MAILING DATE OF THIS COMMUNICATION. 

- Extensions of time may be available under the provisions of 37 CFR 1 .136(a). In no event, however, may a reply be timely filed 
after SIX (6) MONTHS from the mailing date of this communication. 

- If the period for reply specified above is less than thirty (30) days, a reply within the statutory minimum of thirty (30) days will be considered timely. 

- If NO period for reply is specified above, the maximum statutory period will apply and will expire SIX (6) MONTHS from the mailing date of this communication. 

- Failure to reply within the set or extended period for reply will, by statute, cause the application to become ABANDONED (35 U.S.C. § 133). 

- Any reply received by the Office later than three months after the mailing date of this communication, even if timely filed, may reduce any 
earned patent term adjustment. See 37 CFR 1 .704(b). 

Status 

1 )IEl Responsive to communication(s) filed on 18 September 2002 . 
2a)D This action is FINAL. 2b)H] This action is non-final. 

3) D Since this application is in condition for allowance except for formal matters, prosecution as to the merits is 

closed in accordance with the practice under Ex parte Quayle, 1935 CD. 11 , 453 O.G. 213. 
Disposition of Claims 

4) Ex] Claim(s) 1-16 is/are pending in the application. 

4a) Of the above claim(s) 2.5-7,9,10,12,13 and 15 is/are withdrawn from consideration. 

5) D Claim(s) is/are allowed. 

6) [X] Claim(s) 1.3.4.8,11 and 14 is/are rejected. 

7) D Claim(s) is/are objected to. 

8) D Claim(s) are subject to restriction and/or election requirement. 

Application Papers 

9) D The specification is objected to by the Examiner. 

10) D The drawing(s) filed on is/are: a)D accepted or b)D objected to by the Examiner. 

Applicant may not request that any objection to the drawing(s) be held in abeyance. See 37 CFR 1 .85(a). 

11) D The proposed drawing correction filed on is: a)D approved b)D disapproved by the Examiner. 

If approved, corrected drawings are required in reply to this Office action. 

12) D The oath or declaration is objected to by the Examiner. 
Priority under 35 U.S.C. §§119 and 120 

13) [X] Acknowledgment is made of a claim for foreign priority under 35 U.S.C. § 1 19(a)-(d) or (f). 

a)DAII b)D Some*c)D None of: 

1 .□ Certified copies of the priority documents have been received. 

2.Q Certified copies of the priority documents have been received in Application No. . 



3.Q Copies of the certified copies of the priority documents have been received in this National Stage 
application from the International Bureau (PCT Rule 17.2(a)). 
* See the attached detailed Office action for a list of the certified copies not received. 



14)E 



Acknowledgment is made of a claim for domestic priority under 35 U.S.C. § 1 1 9(e) (to a provisional application). 

a) □ The translation of the foreign language provisional application has been received. 
15)D Acknowledgment is made of a claim for domestic priority under 35 U.S.C. §§ 120 and/or 121. 
Attachment(s) 

1 ) Kl Notice of References Cited (PTO-892) 4) □ Interview Summary (PTO-41 3) Paper No(s). 



2) O Notice of Draftsperson's Patent Drawing Review (PTO-948) 

3) Information Disclosure Statement(s) (PTO-1449) Paper Nofs) 4 and 7 . 



5) □ Notice of Informal Patent Application (PTO-152) 

6) Q Other: 



U.S. Patent and Trademark Office 
PTO-326 (Rev. 04-01) 



Office Action Summary 



Part of Paper No. 1 2 
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Art Unit: 1634 
Detailed Action 

1 . Applicant's election of Group I (Claims 1 , 3-4, 8 1 1 and 14 & SEQ ID NO: 365) in the response 

filed 18 SEP 02 (i.e. paper Nos. 9-11) is acknowledged. Accordingly, Claims 2, 5-7, 9-10, 12-13 are 
withdrawn from further consideration by the examiner, 37 CFR 1.142(b), as being drawn to a non-elected 
invention. It is noted that the applicant did not distinctly and specifically point out any supposed errors in 
the restriction requirement, therefore the election has been treated as an election without traverse (MPEP 
§ 818.03(a)). The restriction requirement has been reconsidered, is deemed proper and is therefore, 
herein made FINAL. An action on Claim(s) 1, 3-4, 8, 11 and 14 as they relate to SEQ ID NO: 365 
follows. 



Sequence Rules 



2. 



This application complies with the sequence rules and the sequences have been entered by the 



Scientific and Technical Information Center. 



35 USC § 101 



3. 



35 U.S.C. § 101 reads as follows: 



"Whoever invents or discovers any new and useful process, machine, manufacture, or composition of matter or any 
new and useful improvement thereof, may obtain a patent therefore, subject to the conditions and requirements of 
this title". 
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Claim Rejections - 35 USC § 101 

4. Claim(s) 1,3-4, 8 11 and 14 is/are rejected under 35 U.S.C. § 101 because the claimed 
invention is not supported by either a specific and substantial asserted utility or a well established utility. 

The claimed cDNA compound is not supported by a specific asserted utility because the 
use of SEQ ID NO: 365 or a fragment thereof is generally applicable to any nucleic acid and therefore is 
not particular to the nucleic acid being claimed. Further, the claimed cDNA compound is not supported by 
a substantial utility because the specification only supports the use of the cDNA for making the 
corresponding protein. In this case the protein set forth in SEQ ID NO: 366. Once the protein is obtained, 
the protein would be used in research to functionally characterize the protein. A starting material that can 
only be used to produce a final product does not have a substantial asserted utility in those instances 
where the final product is not supported by a specific and substantial utility. In the instant case the 
protein that is to be produced as a final product resulting from processes involving the claimed cDNA 
does not have an asserted or identified specific and substantial utility. The research contemplated by 
applicants to characterize the potential protein product, especially their biological activities, does not 
constitute a specific and substantial utility. Identifying and studying the properties of the protein itself or 
the mechanisms in which the protein is involved does not define a "real world" context of use. Note, 
because the claimed invention is not supported by a specific and substantial asserted utility for the 
reasons set forth above, credibility has not been assessed. Neither the specification as filed nor any art of 
record discloses or suggests any property or activity for the cDNA compound such that another non- 
asserted utility would be well established for the compound 

35 USC § 112 - 1st Paragraph 

5- The following is a quotation of the first paragraph of 35 U.S.C. 112: 

The specification shall contain a written description of the invention, and of the manner and process of making and 
using it, in such full, clear, concise, and exact terms as to enable any person skilled in the art to which it pertains, or 
with which it is most nearly connected, to make and use the same and shall set forth the best mode contemplated by 
the inventor of carrying out his invention. 
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Claim Rejections under 35 USC § 112- 1st Paragraph 
6- Claim(s) 1, 3-4, 8,11 and 14 is/are rejected under 35 U.S.C. 1 12, first paragraph. 

Specifically, since the claimed invention is not supported by either a specific and 
substantial asserted utility or a well established utility for the reason(s) set forth above, one skilled in the 
art would not know how to use the claimed invention without undue experimentation. 



35 USC § 1 1 2- 2nd Paragraph 

7„ The following is a quotation of the second paragraph of 35 U.S.C. 1 1 2: 

The specification shall conclude with one or more claims particularly pointing out and distinctly claiming the 
subject matter which the applicant regards as his invention. 



Claim Rejections under 35 USC § 112- 2nd Paragraph 

8. Claim(s) 1, 3-4, 8, 11 and 14 is/are rejected under 35 U.S.C. 112, second paragraph, as being 
indefinite for failing to particularly point out and distinctly claim the subject matter which applicant regards 
as the invention. 

Claim(s) 1, 3-4, 8, 11 and 14 recite limitations that have been withdrawn from 
consideration as the result of the restriction/election requirement. Please correct. 

Claim(s) 1 is/are indefinite in that in the preamble the claim recites a "polynucleotide 
comprising" however in section (c) the claim recites "sequences consisting of" These are contradictory in 
that one (i.e. a "polynucleotide comprising") is open language while the other (i.e. "sequences consisting 
of) is closed language. As a result the scope of the claimed invention cannot be determined. Please 
clarify. 
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35 USC §102 



9- The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that may form the 
basis for rejections set forth in this Office action: 

A person shall be entitled to a patent unless -- 

(a) the invention was known or used by others in this country, or patented or described in a printed publication 
in this or a foreign country, before the invention thereof by the applicant for a patent. 

(b) the invention was patented or described in a printed publication in this or a foreign country or in public use or 
on sale in this country, more than one year prior to the date of application for patent in the United States. 

(e) The invention was described in - 

(1) an application for patent, published under section 122(b), by another filed in the United States 
before the invention by the applicant for patent, except that an international application filed under the 
treaty defined in section 351(a) shall have the effect under this subsection of a national application 
published under section 122(b) only if the international application designating the United States was 
published under Article 21 (2)(a) of such treaty in the English language; or 

(2) a patent granted on an application for patent by another filed in the United States before the invention by the 
applicant for patent, except that a patent shall not be deemed filed in the United States for the purposes of this 
subsection based on the filing of an international application filed under the treaty defined in section 351(a) 



Claim Rejections under 35 USC § 102 
1 0. Claim(s) 1 is/are rejected under 35 U.S.C. 1 02(a) as being anticipated by Kikuno et al. (1 999). 

Claim 1 is drawn to an isolated polynucleotide comprising a sequence selected from a 
defined group which includes a polynucleotide sequence provided in SEQ ID NO : 365 or the 
complement of SEQ ID NO: 365 or a sequence that will hybridize to SEQ ID NO: 365 under moderately 
stringent conditions or a sequence consisting of at least 20 contiguous residues of a sequence provided 
in SEQ ID NO: 365 or a sequence which has at least 75% identity with SEQ ID NO: 365 or a sequence 
which has at least 90% identity with SEQ ID NO: 365 or a degenerate variant of SEQ ID NO: 365. 

Kikuno et al. teach an isolated polynucleotide comprising (i.e. provided in) SEQ ID NO : 365. See 
the attached marked Kikuno et al./ SEQ ID NO: 365. Also note the attached marked Emerson et al./ SEQ 
ID NO: 365. Emerson et al. is not prior art, but it could become prior art if a corresponding US patent 
issues which claims priority to the US provisional application cited on the front of WO0121640 A1. The 
sequence taught by Kikuno et al. consists of at least 20 contiguous residues of a sequence provided in 
SEQ ID NO: 365. Note that because Kikuno et al. teach SEQ ID NO: 365 they necessarily (i.e. 
inherently) teach the complement of SEQ ID NO: 365 ( i.e. a sequence that will hybridize to SEQ ID NO: 
365 under moderately stringent conditions) 
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Also, note that the sequence of Kikuno et al. has a sequence which has at least 75% identity with 
SEQ ID NO: 365 and has at least 90% identity with SEQ ID NO: 365. Finally, note that Kikuno et al. teach 
one variant (i.e. degenerate variant) of SEQ ID NO: 365. 

1 1 - Claim(s) 1 and 8 is/are rejected under 35 U.S.C. 102(b) as being anticipated by Sommer et al. 
(1989). 

Claim 1 is drawn to an isolated polynucleotide comprising a sequence selected from a 
defined group which includes a polynucleotide sequence that will hybridize with SEQ ID NO: 365 under 
moderately stringent conditions. 

Claim 8 is drawn to an oligonucleotide that hybridizes to SEQ ID NO: 365 under 
moderately stringent conditions. 

Sornmer et al. teach an isolated polynucleotide (i.e. the first oligo listed in Table 1) which will 
hybridize with SEQ ID NO: 365 under moderately stringent conditions. Admittedly, the oligo taught by 
Sommer et al. will not hybridize throughout it entire length with SEQ ID NO: 365 but it will hybridize 
enough to prime amplification under moderately stringent conditions, therefore it meets all of the 
limitations of Claim 1 part (d) and all of the limitations of Claim 8. See for example, the sequence of SEQ 
ID NO: 365 at nucleotide 1359-1361 which reads TAG. This is the inverse complement of the 3* end of 
the first oligo listed in Table 1 . Note that Sommer teaches the minimal homology requirements for PCR 
primers (i.e. 17-20mers need at least 3 complementary nucleotides at their 3' ends for successful 
priming). One possible way to overcome this rejection is to amend the claim(s) to make it clear that the 
sequences claimed in Claim 1, step(d), and Claim 8 hybridize throughout their entire lengths to SEQ ID 
NO: 365. 

35 USC §103 

1 2. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all obviousness 
rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in section 102 
of this title, if the differences between the subject matter sought to be patented and the prior art are such that the subject 
matter as a whole would have been obvious at the time the invention was made to a person having ordinary skill in the art 
to which said subject matter pertains. Patentability shall not be negatived by the manner in which the invention was 
made. 
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13. This application currently names joint inventors. In considering patentability of the claims under 
35 U.S.C. § 103, the examiner presumes that the subject matter of the various claims was commonly 
owned at the time any inventions covered therein were made absent any evidence to the contrary. 
Applicant is advised of the obligations under 37 CFR 1 .56 to point out the inventor and invention dates of 
each claim that was not commonly owned at the time a later invention was made in order for the 
examiner to consider the applicability of potential 35 U.S.C. § 1 02(f) or (g) prior art under 35 U.S.C. 
§103. 

Claim Rejections under 35 USC § 103 

14. Claim(s) 14 is/are rejected under 35 U.S.C. 102(b) as being anticipated by Sommer et al. (1989) 
as applied against Claim 8 above and further in view of the Stratagene Catalog (1988). 

Claim 14 is drawn to a kit comprising at least one oligo according to Claim 8. 
Sommer et al. teaches an oligo which meets all of the limitations of Claim 8. Sommer et al. does 
not teach a kit. However, as evidenced by the Stratagene Catalog teaching, it was well known at the time 
of the invention to place the reagents needed to perform a nucleic acid based assay into a kit format. 
Therefore, absent an unexpected result, it would have been prima facie obvious to the ordinary artisan at 
the time of the invention to modify the teachings of Sommer et al. with the teachings of the Stratagene 
Catalog wherein the reagents necessary to perform the method suggested by Sommer et al. are placed 
into a kit format. The ordinary artisan would have been motivated to make this modification in order to 
take advantage of the savings and efficiency afforded by kits. 

Conclusion 

15. Ciaim(s) 1, 3-4, 8, 11 and 14 is/are rejected and/or objected to for the reason(s) set forth above. 

1 6. Any inquiry concerning this communication or earlier communications from the examiner should 
be directed to Ethan Whisenant, Ph.D. whose telephone number is (703) 308-6567. The examiner can 
normally be reached Monday-Friday from 8:30AM -5:30PM EST or any time via voice mail. If repeated 
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attempts to reach the examiner by telephone are unsuccessful, the examiner's supervisor, W. Gary 
Jones, can be reached at (703) 308-1 152. 

The fax number for this Examiner is (703) 746-8465. Before faxing any papers please inform the 
examiner to avoid lost papers. Please note that the faxing of papers must conform with the Notice to 
Comply published in the Official Gazette, 1096 OG 30 (November 15, 1989). Any inquiry of a general 
nature or relating to the status of this application should be directed to the group receptionist whose 
telephone number is (703) 308-0196. 




Ethan Whisenant, Ph.D. 
Primary Examiner 



Kikuno et al./SEQ ID NO: 3 65 

RESULT 3 
AB029000 

LOCUS AB029000 4834 bp mRNA linear PRI 04-AUG-1999 

DEFINITION Homo sapiens mRNA for KIAA1077 protein, partial cds . 
ACCESSION AB02 900 0 

VERSION AB029000.1 GI: 5689490 

KEYWORDS 

SOURCE Homo sapiens brain cDNA to mRNA, clone_lib : pBluescriptll SK plus 

clone :hj 06803 . 
ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (sites) 

AUTHORS Kikuno, R., Nagase,T., Ishikawa,K., Hirosawa,M., Miyajima,N., 

Tanaka,A., Kotani,H., Nomura, N. and Ohara,0. 
TITLE Prediction of the coding sequences of unidentified human genes. 

XIV. The complete sequences of 100 new cDNA clones from brain which 

code for large proteins in vitro 
JOURNAL DNA Res. 6 (3), 197-205 (1999) 
MEDLINE 99397452 
REFERENCE 2 (bases 1 to 4834) 

AUTHORS Ohara , 0 . , Nagase , T . and Kikuno , R . 
TITLE Direct Submission 

JOURNAL Submitted (17-JUN-1999) Osamu Ohara, Kazusa DNA Research Institute, 
Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 
292 -0812 , Japan (E-mail : cdnainf o@kazusa . or . jp, Tel : +81-438-52 -3913, 
Fax:+81-438-52-3914) 
FEATURES Location/Qualifiers 
source 1 . .4834 

/organism= "Homo sapiens" 
/db_xref ="taxon: 96 06" 
/chromosome^" 8" 
/clone="hj06803" 
/ tissue_type= "brain" 
/clone_lib= "pBluescriptll SK plus" 
gene 1. .2457 

/gene="KIAA10 77" 
CDS <1. .2457 

/ gene= " KIAA1 0 77" 
/ codon_s tart = 1 
/product="KIAA1077 protein" 
/protein__id="BAA83 02 9 .1" 
/db_xref ="GI : 5689491" 

/ trans 1 at ion= "DVELGSLQVMNKTRKIMEHGGATFINAFVTTPMCCPSRSSMLTG 
KYVHNHNVYTNNENCS SPSWQAMHEPRTFAVYLNNTGYRTAFFGKYLNEYNGS YI PPG 
WREWLGL I KNSRF YNYTVCRNGI KEKHGFDYAKDYFTDL I TNE S INYFKMS KRMYPHR 
PVMMV I S HAAPHG PED S AP Q F S KL YPNAS QH I TP S YN YAPNMD KHW I MQ YTGPML P I H 
MEFTNI LQRKRLQTLMSVDDS VERLYNMLVETGELENTYI I YTADHGYHIGQFGLVKG 
KSMPYDFDIRVPFFIRGPSVEPGSIVPQIVLNIDLAPTILDIAGLDTPPDVDGKSVLK 
LLDPEKPGNRFRTNKKAKIWRDTFLVERGKFLRKKEESSKNIQQSNHLPKYERVKELC 
QQARYQTACEQPGQKWQCIEDTSGKLRIHKCKGPSDLLTVRQSTRNLYARGFHDKDKE 
CSCRESGYRASRSQRKSQRQFLRNQGTPKYKPRFVHTRQTRSLSVEFEGEIYDINLEE 
EEELQVLQPRNIAKRHDEGHKGPRDLQASSGGNRGRMLADSSNAVGPPTTVRVTHKCF 
ILPNDSIHCERELYQSARAWKDHKAYIDKEIEALQDKIKNLREVRGHLKRRKPEECSC 
SKQSYYNKEKGVKKQEKLKSHLHPFKEAAQEVDSKLQLFKENNRRRKKERKEKRRQRK 
GEECSLPGLTCFTHDNNHWQTAPFWNLGSFCACTSSNNNTYWCLRTVNETHNFLFCEF 
ATGFLEYFDMNTDPYQLTNTVHTVERGILNQLHVQLMELRSCQGYKQCNPRPKNLDVG 
NKDGGS YDLHRGQLWDGWEG " 
BASE COUNT 1498 a 992 c 1042 g 13 02 t 



Query Match 100.0%; Score 4834; DB 9; Length 4834; 

Best Local Similarity 100.0%; Pred . No. 0; 

Matches 4834; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 GATGTGGAGCTGGGGTCCCTGCAAGTCATGAACAAAACGAGAAAGATTATGGAACATGGG 6 0 

MMIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIMM 

Db 1 GATGTGGAGCTGGGGTCCCTGCAAGTCATGAACAAAACGAGAAAGATTATGGAACATGGG 6 0 

Qy 61 GGGGCCACCTTCATCAATGCCTTTGTGACTACACCCATGTGCTGCCCGTCACGGTCCTCC 12 0 

MIIMIIIIIIMIIMIIIIIIIIIIIMIIIIMIIIMIIIMIIIMIMIIIII 

Db 61 GGGGCCACCTTCATCAATGCCTTTGTGACTACACCCATGTGCTGCCCGTCACGGTCCTCC 12 0 

Qy 121 ATGCTCACCGGGAAGTATGTGCACAATCACAATGTCTACACCAACAACGAGAACTGCTCT 180 

IMIIIIIIIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIII 

Db 121 ATGCTCACCGGGAAGTATGTGCACAATCACAATGTCTACACCAACAACGAGAACTGCTCT 18 0 

Qy 181 TCCCCCTCGTGGCAGGCCATGCATGAGCCTCGGACTTTTGCTGTATATCTTAACAACACT 24 0 

IMIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIM 

Db 181 TCCCCCTCGTGGCAGGCCATGCATGAGCCTCGGACTTTTGCTGTATATCTTAACAACACT 24 0 

Qy 241 GGCTACAGAACAGCCTTTTTTGGAAAATACCTCAATGAATATAATGGCAGCTACATCCCC 30 0 

IIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIMIIIIIMIIIIMI 

Db 241 GGCTACAGAACAGCCTTTTTTGGAAAATACCTCAATGAATATAATGGCAGCTACATCCCC 3 00 

Qy 301 CCTGGGTGGCGAGAATGGCTTGGATTAATCAAGAATTCTCGCTTCTATAATTACACTGTT 3 60 

IIIIIIIIIMIIIIIIIIIIIMMIIIIIIIMIIIIMMIIIIIIMIIIIIIIII 

Db 301 CCTGGGTGGCGAGAATGGCTTGGATTAATCAAGAATTCTCGCTTCTATAATTACACTGTT 36 0 

Qy 361 TGTCGCAATGGCATCAAAGAAAAGCATGGATTTGATTATGCAAAGGACTACTTCACAGAC 42 0 

IIMIIIMIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIMIIIII 

Db 3 61 TGTCGCAATGGCATGAAAGAAAAGCATGGATTTGATTATGCAAAGGACTACTTCACAGAC 42 0 

Qy 421 TTAATCACTAACGAGAGCATTAATTACTTCAAAATGTCTAAGAGAATGTATCCCCATAGG 4 80 

IMIMIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIMMIi 

Db 421 TTAATCACTAACGAGAGCATTAATTACTTCAAAATGTCTAAGAGAATGTATCCCCATAGG 48 0 

Qy 481 CCCGTTATGATGGTGATCAGCCACGCTGCGCCCCACGGCCCCGAGGACTCAGCCCCACAG 54 0 

I I I I I M I I I I I I I I I I I I I I I I I | I I | M I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | 
Db 481 CCCGTTATGATGGTGATCAGCCACGCTGCGCCCCACGGCCCCGAGGACTCAGCCCCACAG 54 0 

Qy 541 TTTTCTAAACTGTACCCCAATGCTTCCCAACACATAACTCCTAGTTATAACTATGCACCA 60 0 

MIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIMIIIII 

Db 541 TTTTCTAAACTGTACCCCAATGCTTCCCAACACATAACTCCTAGTTATAACTATGCACCA 6 00 

Qy 601 AATATGGATAAACACTGGATTATGCAGTACACAGGACCAATGCTGCCCATCCACATGGAA 66 0 

MIIIIIIIIIIIIIIMIIIIIIIMIIIIIMIIIIIIMIIIIIMMMIIIIIII 

Db 601 AATATGGATAAACACTGGATTATGCAGTACAGAGGACCAATGCTGCCCATCCACATGGAA 660 

Qy 661 TTTACAAACATTCTACAGCGCAAAAGGCTCCAGACTTTGATGTCAGTGGATGATTCTGTG 72 0 

IIIIIIIIIIMIIIIIIIIMMIIIIIIIIIIIIIIIMIIIMIIIIMIIIIIIII 

Db 661 TTTACAAACATTCTACAGCGCAAAAGGCTCCAGACTTTGATGTCAGTGGATGATTCTGTG 72 0 

Qy 721 GAGAGGCTGTATAACATGCTCGTGGAGACGGGGGAGCTGGAGAATAGTTACATCATTTAC 78 0 

IIIMIIIIIMIIIIIIIIIIIIIIIMIIMIIMIIIIIIMIIIIIIIIIIIIIII 

Db 721 GAGAGGCTGTATAACATGCTCGTGGAGACGGGGGAGCTGGAGAATACTTACATCATTTAC 78 0 

Qy 781 ACCGCCGACCATGGTTACCATATTGGGCAGTTTGGACTGGTCAAGGGGAAATCCATGCCA 840 

I I I I I I I I I I I I I I I I I M I I I I I I I I II I I I I I I I I I I I I I | I I | | | | | I I I I I | | | | | 
Db 7 81 ACCGCCGACCATGGTTACCATATTGGGCAGTTTGGACTGGTCAAGGGGAAATCCATGCCA 84 0 



Qy 841 TATGACTTTGATATTCGTGTGCCTTTTTTTATTCGTGGTCCAAGTGTAGAACCAGGATCA 900 

^ a MM 

Db 841 TATGACTTTGATATTCGTGTGCCTTTTTTTATTCGTGGTCCAAGTGTAGAACCAGGATCA 90 0 

Qy 901 ATAGTCCCACAGATCGTTCTCAACATTGACTTGGCCCCCACGATCCTGGATATTGCTGGG 960 

^ MMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIMIIMIMIIMIMI 

Db 901 ATAGTCCCACAGATCGTTCTCAACATTGACTTGGCCCCCACGATCCTGGATATTGCTGGG 960 

Qy 961 CTCGACACACCTCCTGATGTGGACGGCAAGTCTGTCCTCAAACTTCTGGACCCAGAAAAG 102 0 

_ 1 1 M MM I II I II II I II II II IN 1 1 Ml MM 1 1| || Ml II I II II II I II I II II 

Db 9 61 CTCGACACACCTCCTGATGTGGACGGCAAGTCTGTCCTCAAACTTCTGGACCCAGAAAAG 102 0 

Qy 1021 CCAGGTAACAGGTTTCGAACAAACAAGAAGGCCAAAATTTGGCGTGATACATTCCTAGTG 108 0 

I I M I I I II II I II I I I I I I I II I | | | | | M || | | | | | | || | | | | | | | | | | | | | | M I I I 
Db 1021 CCAGGTAACAGGTTTCGAACAAACAAGAAGGCCAAAATTTGGCGTGATACATTCCTAGTG 108 0 

Qy 10 81 GAAAGAGGCAAATTTCTACGTAAGAAGGAAGAAT CCAG CAAGAATAT C CAAC AGT CAAAT 114 0 

I M 1 1 1 1 1 f 1 1 1 1 1 1 i M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 M ]! if I M I 1 1 

Db 10 81 GAAAGAGGCAAATTTC TACGTAAGAAGGAAGAAT C CAGCAAGAAT AT C CAAC AGT CAAAT 114 0 
Qy 1141 CACTTGCC CAAAT ATGAACGGGTCAAAGAACTATGCCAGCAGGCCAGGTACCAGACAGCC 12 00 

~ 1 1 M M I II M II 1 1 M M II II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 M I 1 1 1 1 I II 1 1 1 II 1 1 1 1 1 1 1 

Db 1141 CACTTGCCCAAATATGAACGGGTCAAAGAACTATGCCAGCAGGCCAGGTACCAGACAGCC 1200 
Qy 12 01 TGTGAACAACCGGGGCAGAAGTGGCAATGCATTGAGGATACATCTGGCAAGCTTCGAATT 12 6 0 

I M 1 1 1 1 II II 1 1 1 II I I II I I M 1 1 1 1 1 1 1 1 1 II I II M I 1 1 1 II 1 1 1 1 I II 1 1 1 1 1 1 1 

Db 12 01 TGTGAACAACCGGGGCAGAAGTGGCAATGCATTGAGGATACATCTGGCAAGCTTCGAATT 12 60 

Qy 12 61 CACAAGTGTAAAGGACCCAGTGACCTGCTCACAGTCCGGCAGAGCACGCGGAACCTCTAC 13 2 0 

I I I M I II I II I I I I I I I I I I I I | | | | | | | | | | | | | | || | | | || | | | | | M I I I I I I I I | 

Db 12 61 CACAAGTGTAAAGGACCCAGTGACCTGCTCACAGTCCGGCAGAGCACGCGGAACCTCTAC 13 2 0 

Qy 13 21 GCTCGCGGCTTCCATGACAAAGACAAAGAGTGCAGTTGTAGGGAGTCTGGTTACCGTGCC 13 8 0 

nK 100 M MM I II MM I II I II II I II II II 1 1| I || || M Ml II III Mill II III II II 

Db 13 21 GCTCGCGGCTTCCATGACAAAGACAAAGAGTGCAGTTGTAGGGAGTCTGGTTACCGTGCC 13 80 

Qy 13 81 AGCAGAAGCCAAAGAAAGAGTCAACGGCAATTCTTGAGAAACCAGGGGACTCCAAAGTAC 144 0 

I I M I II I I I I I I I I II I I | || I I I I I II I I I I | | | | | | | | | | | || I II M I I I I I I I II 

Db 13 81 AGCAGAAGCCAAAGAAAGAGTCAACGGCAATTCTTGAGAAACCAGGGGACTCCAAAGTAC 144 0 

Qy 1441 AAGCCCAGATTTGTCCATACTCGGCAGACACGTTCCTTGTCCGTCGAATTTGAAGGTGAA 15 00 

I M II I I I II I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | M II I I II II II M I I I I I 

Db 1441 AAGCCCAGATTTGTCCATACTCGGCAGACACGTTCCTTGTCCGTCGAATTTGAAGGTGAA 1500 

Qy 15 01 ATATATGACATAAATCTGGAAGAAGAAGAAGAATTGCAAGTGTTGCAACCAAGAAACATT 15 60 

i I M II II M I I I I I II I I I M II I I I II I I I I I I I I | | | | M I I I I I I I I I I M I I I I | 
Db 15 01 ATATATGAC ATAAAT CTGGAAGAAGAAGAAGAATTGCAAGTGTTGCAAC CAAGAAACATT 15 60 

Qy 15 61 GCTAAGCGTCATGATGAAGGCCACAAGGGGCCAAGAGATCTCCAGGCTTCCAGTGGTGGC 162 0 

II I MM III II Mill Mill II III II II I MM III II I MM Mill II Ml II II 

Db 15 61 GCTAAGCGTCATGATGAAGGCCACAAGGGGCCAAGAGATCTCCAGGCTTCCAGTGGTGGC 162 0 
Qy 1621 AACAGGGGCAGGATGCTGGCAGATAGCAGCAACGCCGTGGGCCCACCTACCACTGTCCGA 16 80 

II I MM III II I Mill 1 1 II II III II III II || 1 1| || | MM Ml M Mill II 1 1 

Db 1621 AACAGGGGCAGGATGCTGGCAGATAGCAGCAACGCCGTGGGCCCACCTACCACTGTCCGA 16 80 

Qy 1681 GTGACACACAAGTGTTTTATTCTTCCCAATGACTCTATCCATTGTGAGAGAGAACTGTAC 174 0 

1 1 I 1 1 M I II I I I I I I I II I I I I I I II II I M I II I I I I II I I II I I I I I I I I | I I M I I 
Db 1681 GTGACACACAAGTGTTTTATTCTTCCCAATGACTCTATCCATTGTGAGAGAGAACTGTAC 174 0 



Qy 1741 CAATCGGCCAGAGCGTGGAAGGACCATAAGGCATACATTGACAAAGAGATTGAAGCTCTG 18 0 0 

MIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIMIIIIIIIIMII 

Db 1741 CAAT CGGC CAGAG CGTGGAAGGACCAT AAGGCATACATTGACAAAGAGATTGAAGC T CTG 18 0 0 
Qy 1801 CAAGATAAAATTAAGAATTTAAGAGAAGTGAGAGGACAT CTGAAGAGAAGGAAGC C TGAG I86 0 

IMIIIIIIIIIIIIIIIIIIIIMIIMMIMIIIMIIIIIIIIIIIMllllllll 

Db 1801 CAAGATAAAATTAAGAATTTAAGAGAAGTGAGAGGACATCTGAAGAGAAGGAAGCCTGAG 18 6 0 

Qy 18 61 GAATGTAGCTGCAGTAAACAAAGCTATTACAATAAAGAGAAAGGTGTAAAAAAGCAAGAG 192 0 

I M I I I I I I I I I II I I I I I I | | | | | | | | | | | | | | M I I I I I I I I M I I I I I I I | I I M II 
Db 1861 GAATGTAGCTGCAGTAAACAAAGCTATTACAATAAAGAGAAAGGTGTAAAAAAGCAAGAG 192 0 

Qy 1921 AAATTAAAGAGC CATCT T C AC C CAT TCAAGGAGGCTG C T CAGGAAGTAGAT AGCAAAC TG 1980 

IIMIMIIIMIIIIIIIIMIIIIIIIIIIIIIMIIIIIMIMIIIIIIIIIIIM 

Db 1921 AAATTAAAGAGCCATCTTCACCCATTCAAGGAGGCTGCTCAGGAAGTAGATAGCAAACTG 198 0 

Qy 1981 CAACTTTT CAAGGAGAACAAC CGTAGGAGGAAGAAGGAGAGGAAGGAGAAGAGACGGCAG 2040 

I M I I I I M I I M M I I I I I I I I I I I I I || || || | | | | | | | I | | | | | | | | | | | | | | | || | 
Db 1981 CAACTTTTCAAGGAGAACAACCGTAGGAGGAAGAAGGAGAGGAAGGAGAAGAGACGGCAG 2 04 0 

Qy 2 041 AGGAAGGGGGAAGAGTGCAGCCTGCCTGGCCTCACTTGCTTCACGCATGACAACAACCAC 210 0 

MMMMMMIIIIMMMIMIIMMIMIIIIIIMMMIMIMMMIMI 

Db 2 041 AGGAAGGGGGAAGAGTGCAGCCTGCCTGGCCTCACTTGCTTCACGCATGACAACAACCAC 210 0 
Qy 2101 TGGCAGACAGCCCCGTTCTGGAACCTGGGATCTTTCTGTGCTTGCACGAGTTCTAACAAT 216 0 

I MM II I Mill II III MM I II II I MM Ml II 1 1 III II Ml II III II II I II I 

Db 2101 TGGCAGACAGCCCCGTTCTGGAACCTGGGATCTTTCTGTGCTTGCACGAGTTCTAACAAT 216 0 
Qy 2161 AACACCTACTGGTGTTTGCGTACAGTTAATGAGACGCATAATTTTCTTTTCTGTGAGTTT 2 22 0 

nK _ Ml Mill Mill II III II II I II III MM III II II III II III II Ml || || | || | 

Db 2161 AACACCTACTGGTGTTTGCGTACAGTTAATGAGACGCATAATTTTCTTTTCTGTGAGTTT 222 0 
Qy 2 221 GCTACTGGCTTTTTGGAGTATTTTGATATGAATACAGATCCTTATCAGCTCACAAATACA 22 8 0 

M I M 1 1 1 I M II 1 1 II I M 1 1 1 II 1 1 1 I II I M I 1 1 1 1 1 1 1 II M I 1 1 1 1 1 M M I II I 

Db 2 221 GCTACTGGCTTTTTGGAGTATTTTGATATGAATACAGATCCTTATCAGCTCACAAATACA 22 8 0 
Qy 2281 GTGCACACGGTAGAACGAGGCATTTTGAATCAGCTACACGTACAACTAATGGAGCTCAGA 2 34 0 

I 1 1 I f f 1 1 1 1 1 1 1 1 1 1 ! I! I 1 1 ! 1 1 1 1 1 1 1 1 1 I [ I 1 1 1 1 1 1 1 1 1 1 1 1 ! | 1 1 1 1 1 1 1 M I I 

Db 2281 GTGCACACGGTAGAACGAGGCATTTTGAATCAGCTACACGTACAACTAATGGAGCTCAGA 2 34 0 
Qy 2 341 AGCTGTCAAGGATATAAGCAGTGCAACCCAAGACCTAAGAATCTTGATGTTGGAAATAAA 24 0 0 

Ml II I II Mill Mill 1 1 III II 1 1 Ml 1 1 III II III MM I II II Ml II I II III 

Db 2 341 AGCTGTCAAGGATATAAGCAGTGCAACCCAAGACCTAAGAATCTTGATGTTGGAAATAAA 2 4 0 0 
Qy 2 401 GATGGAGGAAGCT ATGAC CTACACAGAGGACAGTTATGGGATGGATGGGAAGGTTAAT CA 246 0 

I M M I II II I II II M I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I M | II I II I 1 1 I 1 1 II M I I M II 1 1 1 

Db 2401 GATGGAGGAAGCTATGACCTACACAGAGGACAGTTATGGGATGGATGGGAAGGTTAATCA 246 0 

Qy 2461 GCCCCGTCTCACTGCAGACATCAACTGGCAAGGCCTAGAGGAGCTACACAGTGTGAATGA 2 52 0 

I 1 1 I M I I I I I II I I II I I I I II I I II I I I I || II I I I I II I I I I I I | I | | M II II II I 
Db 2461 GCCCCGTCTCACTGCAGACATCAACTGGCAAGGCCTAGAGGAGCTACACAGTGTGAATGA 2 52 0 

Qy 2 521 AAACATCTATGAGTACAGACAAAACTACAGACTTAGTCTGGTGGACTGGACTAATTACTT 2 58 0 

1 1 1 1 1 M I I II II II II II I M I II M I 1 1 1 M 1 1 1 1 1 1 1 1 1 II I 1 1 I II 1 1 1 1 1 1 1 | M 

Db 2 521 AAACATCTATGAGTACAGACAAAACTACAGACTTAGTCTGGTGGACTGGACTAATTACTT 2 58 0 
Qy 2 581 GAAGGATTTAGATAGAGTATTTGCACTGCTGAAGAGTCACTATGAGCAAAATAAAACAAA 2 64 0 

Ml Mill I II Ml II II 1 1 Ml Mill II III II II I II II II I II Ml II Mill Ml 

Db 2 581 GAAGGATTTAGATAGAGTATTTGCACTGCTGAAGAGTCACTATGAGCAAAATAAAACAAA 2 64 0 



Qy 2 641 TAAGACTCAAACTGCTCAAAGTGACGGGTTCTTGGTTGTCTCTGCTGAGCACGCTGTGTC 2 70 0 

_ _ MM 1 1 MM I III Mill II II I II II III II 1 1 III II IN II Ml II || | MM | || 

Db 2 641 TAAGACTCAAACTGCTCAAAGTGACGGGTTCTTGGTTGTCTCTGCTGAGCACGCTGTGTC 2 70 0 
Qy 2 701 AATGGAGATGGCCTCTGCTGACTCAGATGAAGACCCAAGGCATAAGGTTGGGAAAACACC 2 760 

MIIMIIIIIIIIIIIIMIIIIIIMIIIIMMIMIIMIIIIIIIIMIIIIIII 

Db 2 701 AATGGAGATGGCCTCTGCTGACTCAGATGAAGACCCAAGGCATAAGGTTGGGAAAACACC 2 760 

Qy 2 761 TCATTTGACCTTGCCAGCTGACCTTCAAACCCTGCATTTGAACCGACCAACATTAAGTCC 2 82 0 

nK _ I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | 

Db 2761 TCATTTGACCTTGCCAGCTGACCTTCAAACCCTGCATTTGAACCGACCAACATTAAGTCC 282 0 

Qy 2 821 AGAGAGTAAACTTGAATGGAATAACGACATTCCAGAAGTTAATCATTTGAATTCTGAACA 28 8 0 

, 0 1 1 1 1 I 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 I | I 1 1 1 1 M I 1 1 I 1 1 1 1 I 1 1 I I 1 1 I 1 1 

Db 2 821 AGAGAGTAAACTTGAATGGAATAACGACATTCCAGAAGTTAATCATTTGAATTCTGAACA 28 8 0 

Qy 2 881 CTGGAGAAAAACCGAAAAATGGACGGGGCATGAAGAGACTAATCATCTGGAAACCGATTT 2 94 0 

M i 1 1 M I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | M I II I I I I M I I I | 
Db 2 881 CTGGAGAAAAAC CGAAAAATGGACGGGGCATGAAGAGACTAAT CAT CTGGAAAC CGATTT 2 94 0 

Qy 2 941 CAGTGGCGATGGCATGACAGAGCTAGAGCTCGGGCCCAGCCCCAGGCTGCAGCCCATTCG 3 0 00 

_ OQ M Ml MM I MINI III II I II II II I II MM III II I II 1 1 Ml III II 1 1 II I II 

Db 2 941 CAGTGGCGATGGCATGAGAGAGCTAGAGCTCGGGCCCAGCCCCAGGCTGCAGCCCATTCG 3 0 00 
Qy 3 001 CAGGCACCCGAAAGAACTTCCCCAGTATGGTGGTCCTGGAAAGGACATTTTTGAAGATCA 3 0 60 

nK Qnni M Ml IMM I III 1 1 III II MM I II Ml Ml MM 1 1 MM MM I Mill I III 1 1 

Db 3 0 01 CAGGCACCCGAAAGAACTTCCCCAGTATGGTGGTCCTGGAAAGGACATTTTTGAAGATCA 3 060 

Qy 3 061 ACTATATCTTCCTGTGCATTCCGATGGAATTTCAGTTCATCAGATGTTCACCATGGCCAC 312 0 

nK , M M M M I II II I I I II I II M || || || | || || | || M M II I I II I II || II I II I II 

Db 3 0 61 ACTATATCTTCCTGTGCATTCCGATGGAATTTCAGTTCATCAGATGTTCACCATGGCCAC 312 0 

Qy 3121 CGCAGAACACCGAAGTAATTCCAGCATAGCGGGGAAGATGTTGACCAAGGTGGAGAAGAA 3180 

1 1 1 1 ! 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I ! 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 M I 1 1 1 1 1 1 1 1 1 MIMM 

Db 3121 CGCAGAACACCGAAGTAATTCCAGCATAGCGGGGAAGATGTTGACCAAGGTGGAGAAGAA 3180 
Qy 3181 TCACGAAAAGGAGAAGTCACAGCACCTAGAAGGCAGCGCCTCCTCTTCACTCTCCTCTGA 32 4 0 

MM I M I Ml Ml I MM III Ml II MIMM MM III II I MM Mill Mill II 

Db 3181 TCACGAAAAGGAGAAGTCACAGCACCTAGAAGGCAGCGCCTCCTCTTCACTCTCCTCTGA 324 0 

Qy 32 41 TTAGATGAAACTGTTACCTTACCCTAAACACAGTATTTCTTTTTAACTTTTTTATTTGTA 33 00 

I I I I M I II II II I II I I | || II II II I II II II I II II II I II I I | | M II I I II I I II 
Db 32 41 TTAGATGAAACTGTTACCTTACCCTAAACACAGTATTTCTTTTTAACTTTTTTATTTGTA 33 0 0 

Qy 3 3 01 AACTAATAAAGGTAATCACAGCCACCAACATTCCAAGCTACCCTGGGTACCTTTGTGCAG 3 3 6 0 

^ ' MMI Mill Mill II M MUM MIMM Mill II Mill II II Mill MIMM 

Db 33 01 AACTAATAAAGGTAATCACAGCCACCAACATTCCAAGCTACCCTGGGTACCTTTGTGCAG 3 3 60 

Qy 3 3 61 TAGAAGCTAGTGAGCATGTGAGCAAGCGGTGTGCACACGGAGACTCATCGTTATAATTTA 34 2 0 

M M II II II I II I I II II I I I I II M II II II I II II I I II I I II I II | || I || || II I 
Db 3 3 61 TAGAAGCTAGTGAGCATGTGAGCAAGCGGTGTGCACACGGAGACTCATCGTTATAATTTA 342 0 

Qy 3421 CTATCTGCCAAGAGTAGAAAGAAAGGCTGGGGATATTTGGGTTGGCTTGGTTTTGATTTT 34 8 0 

^ _ Mill Mill Mill II Mill MIMM Mill Mill Mill II III II III MM II 

Db 3421 CTATGTGCCAAGAGTAGAAAGAAAGGCTGGGGATATTTGGGTTGGCTTGGTTTTGATTTT 34 8 0 
Qy 3481 TTGCTTGTTTGTTTGTTTTGTACTAAAACAGTATTATCTTTTGAATATCGTAGGGACATA 354 0 

I M I II II II II 1 1 1 M II II M II I II M I II I II II I I II 1 1 II M M II II II 1 1 II 

Db 3481 TTGCTTGTTTGTTTGTTTTGTACTAAAACAGTATTATCTTTTGAATATCGTAGGGACATA 354 0 



1 o 



Qy 


3541 


AGTATATACATGTTATCCAATCAAGATGGCTAGAATGGTGCCTTTCTGAGTGTCTAAAAC 


3600 


Db 


3541 


1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
AGTATATACATGTTATCCAATCAAGATGGCTAGAATGGTGCCTTTCTGAGTGTCTAAAAC 


3600 


Qy 


3601 


TTGACACCCCTGGTAAATCTTTCAACACACTTCCACTGCCTGCGTAATGAAGTTTTGATT 


3660 


Db 


3601 


1 1 M 1 1 M II 1 1 1 i 1 II 1 M II 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 II 1 1 1 II 1 II 1 1 1 1 1 II 1 1 
TTGACACCCCTGGTAAATCTTTCAACACACTTCCACTGCCTGCGTAATGAAGTTTTGATT 


3660 


Qy 


3661 


CATTTTTAACCACTGGAATTTTTCAATGCCGTCATTTTCAGTTAGATGATTTTGCACTTT 


3720 


Db 


3661 


M 1 II II M M II 1 M 1 M 1 M II M M M M 1 1 1 II M 1 1 1 M 1 1 1 i 1 1 M M M 1 1 M 

CATTTTTAACCACTGGAATTTTTCAATGCCGTCATTTTCAGTTAGATGATTTTGCACTTT 


3720 


Qy 


3721 


GAGATTAAAATGCCATGTCTATTTGATTAGTCTTATTTTTTTATTTTTACAGGCTTATCA 


3780 


Db 


3721 


MUM ! 1 1 1 J 1 1 1 1 1 J 1 1 1 1 1 1 J 1 MINI MM II III II II III II III II UNI 1 

GAGATTAAAATGCCATGTCTATTTGATTAGTCTTATTTTTTTATTTTTACAGGCTTATCA 


3780 


Qy 


3781 


GTCTCACTGTTGGCTGTCATTGTGACAAAGTCAAATAAACCCCCAAGGACGACACACAGT 


3840 


Db 


3781 


II MM II II 1 II II 1 II II M 1 II II MM MM II Ml MM II MM II II 1 II II 1 

GTGTCACTGTTGGCTGTCATTGTGACAAAGTCAAATAAACCCCCAAGGACGACACACAGT 


3840 


Qy 


3841 


ATGGATCACATATTGTTTGACATTAAGCTTTTGCCAGAAAATGTTGCATGTGTTTTACCT 


3900 


Db 


3841 


1 II 1 II II II M 1 1 1 1 II II 1 1 II 1 1 1 III M II 1 1 1 II 1 II 1 1 M 1 1 M II II 1 II III 
ATGGATCACATATTGTTTGACATTAAGCTTTTGCCAGAAAATGTTGCATGTGTTTTACCT 


3900 


Qy 


3901 


CGACTTGCTAAAATCGATTAGCAGAAAGGCATGGCTAATAATGTTGGTGGTGAAAATAAA 


3960 


Db 


3901 


1 1 MINI MMIIMMII Mill Mill Mill III MM III MIMMMMMII 

CGACTTGCTAAAATCGATTAGCAGAAAGGCATGGCTAATAATGTTGGTGGTGAAAATAAA 


3960 


Qy 


3961 


TAAATAAGTAAACAAAATGAAGATTGCGTGCTCTCTCTGTGCCTAGCCTCAAAGCGTTCA 


4020 


Db 


3961 


1 M Ml M M M Ml 1 II II Mill Mill II Mill MMMM III II II II 1 II III 

TAAATAAGTAAACAAAATGAAGATTGCCTGCTCTCTCTGTGCCTAGCCTCAAAGCGTTCA 


4020 


Qy 


4021 


TCATACATCATACCTTTAAGATTGCTATATTTTGGGTTATTTTCTTGACAGGAGAAAAAG 


4080 


Db 


4021 


IMIMI MM M MMMM MIMM II Mill M MMMM Mill MM MINI 

TCATACATCATACCTTTAAGATTGCTATATTTTGGGTTATTTTCTTGACAGGAGAAAAAG 


4080 


Qy 


4081 


ATCTAAAGATCTTTTATTTTCATCTTTTTTGGTTTTCTTGGCATGACTAAGAAGCTTAAA 


4140 


Db 


4 081 


IMIMIMM IMIMI MIMM MINI MM MMMMM MM MIMM MM 

ATCTAAAGATCTTTTATTTTCATCTTTTTTGGTTTTCTTGGCATGACTAAGAAGCTTAAA 


4140 


Qy 


4141 


TGTTGATAAAATATGACTAGTTTTGAATTTACACCAAGAACTTCTCAATAAAAGAAAATC 

M 1 1 1 1 1 1 1 1 1 Mill IIMIIIIIIIMIMMIMI II Mill Mill MMMMM 

TGTTGATAAAATATGACTAGTTTTGAATTTACACCAAGAACTTCTCAATAAAAGAAAATC 


4200 


Db 


4141 


4200 


Qy 


4201 


ATGAATGCTCCACAATTTCAACATACCACAAGAGAAGTTAATTTCTTAACATTGTGTTCT 


4260 


Db 


4201 


II Mill II II MMMMM III II Mill II Ml Mill MM MMMM II Mill 

ATGAATGCTCGACAATTTCAACATACCACAAGAGAAGTTAATTTCTTAACATTGTGTTCT 


4260 


Ov 


4261 


ATGATTATTTGTAAGAPrTTPAGGAAGTTCTGATATPTTTTAAArJAPATAnTTr'A A A ATT 


4320 


Db 


4261 


IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMIIIII 

ATGATTATTTGTAAGACCTTCACCAAGTTCTGATATCTTTTAAAGACATAGTTCAAAATT 


4320 


Qy 


4321 


GCTTTTGAAAATCTGTATTCTTGAAAATATCCTTGTTGTGTATTAGGTTTTTAAATACCA 


4380 


Db 


4321 


M M M II M 1 M 1 M II M II M II 1 1 II M II II II 1 1 II II II II 1 II M M II II 1 

GCTTTTGAAAATCTGTATTCTTGAAAATATCCTTGTTGTGTATTAGGTTTTTAAATACCA 


4380 


Qy 


4381 


GCTAAAGGATTACCTCACTGAGTCATCAGTACCCTCCTATTCAGCTCCCCAAGATGATGT 


4440 


Db 


4381 


IMMMMMIMM MMMMM II Mill MIMMMMMII M MMMMM 

GCTAAAGGATTACCTCACTGAGTCATCAGTACCCTGCTATTCAGCTCCCCAAGATGATGT 


4440 



Qy 


4441 


GTTTTTGCTTACCCTAAGAGAGGTTTTCTTCTTATTTTTAGATAATTCAAGTGCTTAGAT 


4500 


Db 


4441 


1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
GTTTTTGCTTACCCTAAGAGAGGTTTTCTTCTTATTTTTAGATAATTCAAGTGCTTAGAT 


4500 


Qy 


4501 


AAATTATGTTTTCTTTAAGTGTTTATGGTAAACTCTTTTAAAGAAAATTTAATATGTTAT 


4560 


Db 


4501 


MIIIMIIIIIIIIIIIIIMIIIMIIMIIIIMIIIIIIIIMIIIIIMIIIIII 

AAATTATGTTTTCTTTAAGTGTTTATGGTAAACTCTTTTAAAGAAAATTTAATATGTTAT 


4560 


Qy 


4561 


AGCTGAATCTTTTTGGTAACTTTAAATCTTTATCATAGACTCTGTACATATGTTCAAATT 


4620 


Db 


4561 


II M 1 1 M M M II M II M M 1 M M i M II 1 1 1 M II 1 M 1 1 II 1 1 II 1 1 II M 1 i M 

AGCTGAATCTTTTTGGTAACTTTAAATCTTTATCATAGACTCTGTACATATGTTCAAATT 


4620 


Qy 


4621 


AGCTGCTTGCCTGATGTGTGTATCATCGGTGGGATGACAGAACAAACATATTTATGATCA 


4680 


Db 


4621 


1 1 1 ! 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 f 1 ! [ I 1 1 1 1 1 1 1 1 M i 1 1 1 1 1 1 1 1 i M M I M M 1 1 M 1 

AGCTGCTTGCCTGATGTGTGTATCATCGGTGGGATGACAGAACAAACATATTTATGATCA 


4680 


Ov 


4681 


TGAATAATGTG CT TT GT AAAAAGAT T T C AAGT T ATT AGGAAG CAT AC T CTGT T TT T T AAT 


A HA n 


Db 


4681 


1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 

TGAATAATGTGCTTTGTAAAAAGATTTCAAGTTATTAGGAAGCATACTCTGTTTTTTAAT 


4740 


Qy 


4741 


CATGTATAATATTCCATGATACTTTTATAGAACAATTCTGGCTTCAGGAAAGTCTAGAAG 


4800 


Db 


4741 


IMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIII 

CATGTATAATATT CCATGATACTT TT AT AGAAC AAT TCTGGCTT C AGGAAAGT CT AGAAG 


4800 


Qy 


4801 


CAATATTTCTTCAAATAAAAGGTGTTTAAACTTT 4 834 




Db 


4801 


IIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIM 

CAATATTTCTTCAAATAAAAGGTGTTTAAACTTT 4 834 





o 
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Emerson et al • / SEQ ID NO s 3 65 

RESULT 2 
AAD03774 

ID AAD03774 Standard; cDNA; 4834 BP. 
XX 

AC AAD03 774 ; 
XX 

DT 19-JUN-2001 (first entry) 
XX 

DE Human sulfatase (HSulf-1) AB029000 cDNA 
XX 
KW 
KW 
KW 

KW ss 
XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 1..4833 
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aa 


:Xaa) 


(pos 


:2641 


. .2643, 


aa 


:Xaa) 


(pos 


:2662 


. .2664, 


aa 


:Xaa) 


(pos 


:2686 


. . 2688, 


aa 


:Xaa) 


(pos 


:2719 


.2721, 


aa 


:Xaa) 


(pos 


:2728 


.2730, 


aa 


:Xaa) 


(pos 


:2743 


.2745, 


aa 


:Xaa) 


(pos 


:2779. 


. 2781, 


aa 


:Xaa) 


(pos 


:2827. 


.2829, 


aa 


:Xaa) 


(pos 


:2833 . 


.2835, 


aa 


:Xaa) 


(pos 


2842 . 


. 2844, 


aa 


:Xaa) 


(pos 


2860 . 


. 2862, 


aa 


:Xaa) 


(pos 


2875. 


. 2877, 


aa 


:Xaa) 


(pos 


2911. 


.2913, 


aa 


:Xaa) 


(pos 


2920. 


. 2922, 


aa 


Xaa) 


(pos : 


3052 . 


. 3054, 


aa 


Xaa) 


(pos : 


3136. 


. 3138, 


aa 


Xaa) 


(pos : 


3238 . 


. 3240, 


aa , 


Xaa) 


(pos : 


3265. 


. 3267, 


aa : 


Xaa) 


(pos : 


3304 . 


. 3306, 


aa: 


Xaa) 


(pos : 


3307. 


.3309, 


aa ; 


Xaa) 


(pos : 


3313 . 


.3315, 


aa: 


Xaa) 


(pos : 


3361. 


. 3363, 


aa: 


Xaa) 


(pos : 


3379 . 


. 3381, 


aa : 


Xaa) 


(pos : 


3571 . 


. 3573, 


aa : 


Xaa) 


(pos : 


3595 . 


.3597, 


aa : 


Xaa) 


(pos : 


3667. 


.3669, 


aa : 


Xaa) 


(pos : 


3703 . 


.3705, 


aa : 


Xaa) 


(pos : 


3748 . 


.3750, 


aa : 


Xaa) 


(pos : 


3919. 


.3921, 


aa : 


Xaa) 


(pos : 


3961 . 


.3963, 


aa : 


Xaa) 



FT 


/transl 


excepts 


(pos 


:4066 . 


.4068, 


aa 


:Xaa) 


FT 


/ transl 


except^ 


(pos 


:4084 . 


.4086, 


aa 


:Xaa) 


FT 


/ transl 


except= 


(pos 


:4144 . 


.4146, 


aa 


:Xaa) 


FT 


/transl_ 


_except= 


(pos 


:4147 . 


.4149, 


aa 


:Xaa) 


FT 


/ transl 


except= 


(pos 


:4189 . 


.4191, 


aa 


:Xaa) 


FT 


/transl 


except^ 


(pos 


:4291 . 


.4293, 


aa 


:Xaa) 


FT 


/ transl_ 


except= 


(pos 


:4309 . 


.4311, 


aa 


:Xaa) 


FT 


/transl_ 


_except= 


(pos 


■4342 . 


.4344, 


aa 


:Xaa) 


FT 


/ transl 


except= 


(pos 


:4372 . 


.4374, 


aa 


:Xaa) 


FT 


/transl 


_except= 


(pos 


.4399 . 


.4401, 


aa 


:Xaa) 


FT 


/transl_ 


_except= 


(pos 


4435 . 


.4437, 


aa 


:Xaa) 


FT 


/transl_ 


except= 


(pos 


:4483 . 


.4485, 


aa 


:Xaa) 


FT 


/transl_ 


except= 


(pos 


4516 . 


.4518, 


aa 


:Xaa) 


FT 


/transl 


_except= 


(pos 


4564 . 


.4566, 


aa 


:Xaa) 


FT 


/transl 


except^ 


(pos 


4675 . 


.4677, 


aa 


:Xaa) 


FT 


/transl 


_except= 


(pos 


4681 . 


.4683, 


aa 


:Xaa) 


FT 


/transl 


except= 


(pos 


4647 . 


.4749, 


aa 


:Xaa) 


FT 


/transl 


except= 


(pos 


4768 . 


.4770, 


aa 


:Xaa) 


FT 


/ trans 1_ 


_except= 


(pos 


4795 . 


,4797, 


aa 


:Xaa) 


FT 


/transl_ 


_except= 


(pos- 


4816 . 


.4818, 


aa 


:Xaa) 



FT /note= "Xaa corresponds to inframe stop codon; CDS 

FT does not include start and stop codons " 

FT /partial 
XX 

PN WO200121640-A1 . 
XX 

PD 29-MAR-2001. 
XX 

PF 22-SEP-2000; 2000WO-US26124 . 
XX 

PR 23-SEP-1999; 99US- 015573 8 . 
XX 

PA (UYPE-) UNIV PENNSYLVANIA. 

PA (ROYA-) ROYAL VETERINARY COLLEGE. 

XX 

PI Emerson CP, Dhoot GK; 
XX 

DR WPI; 2001-266062/27. 

DR P-PSDB; AAE00438. 
XX 

PT Novel Sulf-1 or Sulf-2 (members of subfamily of sulfatases) polypeptide 

PT useful for treating musculoskeletal, neural or renal degenerative 

PT disorder, and for inhibiting viral infection of cells 
XX 

PS Claim 2/ Page 40-42; 59pp; English. 
XX 

CC The present cDNA sequence encodes human sulfatase (HSulf-1) which is 

CC obtained from EST (Expressed Sequence Tag) AB029000. 

CC The invention relates to Sulf-1 and Sulf-2 proteins and their 

CC corresponding cDNA molecules which are the members of subfamily of 

CC sulfatases. These sulfatase proteins are expressed in neural and muscle 

CC lineages in various species. Sulfatase proteins are useful for modifying 

CC growth properties of cells, preferably cancer cells, useful in the 

CC treatment of cancer and in the inhibition of metastases. Sulf-1 and 

CC Sulf-2 are useful in developing cells for transplant in the treatment of 

CC skeletomuscular degenerative diseases, neurodegenerative diseases, renal 

CC degenerative diseases and in initiation growth of healthy cells and to 

CC heal diseased cells in these disorders. Sulfatases are also useful for 

CC inhibiting infection of cells by viruses which utilise sulfated heparin 

CC proteoglycans for entry into cells, and for modulating recruitment of 

CC lymphocytes by cells to sites of inflammation. A functional embryonic 



CC technique is useful to functionally characterise members of Sulf-1 and 

CC Sulf-2 sulfatase gene subfamily, which is efficient and economical. 
XX 

SQ Sequence 4834 BP; 1498 A; 992 C; 1042 G; 1302 T; 0 other; 



Query Match 100.0%; Score 4834/ DB 22; Length 4834/ 

Best Local Similarity 100.0%; Pred, No. 0; 



Matches 


4 834; Conservative 0; Mismatches 0; Indels 0; Gaps 


Ov 


i 


GATGTGGAGPTGGGGTCPPTGCAAGTCATGAAPAAAAPGAGAAAGATTATGG 

vjii x Vvj x vJnjXj-vj jl vjvjvjvj j_ > — • x \jv.rvri\j jl n^jti x \jm^c\ virion v—- vjJ^\^x^/TjT.vJJxi x x jri x ^Jvj^xriv -xii x \jvjrs_jr 


D VJ 


Db 


i 


1 1 i i i 1 1 1 1 1 ; 1 1 1 1 , 1 1 1 ; i f 1 1 1 [ 1 1 i 1 1 1 1 1 ' 1 1 1 1 < i ; 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 

GATGTGGAGCTGGGGT CCCTGC AAGT CATC AACAAAACGAGA AAGATTATGGAAPATGGG 


6 0 


Ov 


61 


GGGGCCACCTTCATCAATGCCTTTGTGACTACACCCATGTGCTGCCCGTCACGGTCCTCC 


120 


Db 


61 


IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIMIIIIII 

GGGGCCACCTTCATCAATGCCTTTGTGACTACACCCATGTGCTGCCCGTCACGGTCCTCC 


12 0 


Ov 


121 


ATGCTCACCGGGAAGTATGTGCACAATCACAATGTCTACACCAACAACGAGAAPTOrTrT 


18 0 


Db 


121 


II II II III II I INI III II II III Mill II II II MM II II III II III II II Ml 

ATGCTCACCGGGAAGTATGTGCACAATCArAATGTCTACACCAACAACGAGAArTC^CTPT 


18 0 


Ov 


181 


TCCCCCTCGTGGCAGGCCATGCATGAGCCTCGGACTTTTGCTGTATATCTTAACAACACT 


24 0 


Db 


181 


MM Ml II II Ml III MINIMI Mill Mill Mill MM III III MM Mill 

TCCCCCTCGTGGCAGGCCATGCATGAGCCTCGGACTTTTGCTGTATATCTTAACAACACT 


24 0 


Ov 


241 


GGCTACAGAACAGCCTTTTTTGGAAAATACCTCAATGAATATAATGGCAGCTACATCCCC 


3 0 0 


Db 


241 


II II III Mill II II III II III II Mill Mill II III II II III II III II Mill 

GGCTACAGAACAGCCTTTTTTGGAAAATACCTCAATGAATATAATGGCAGCTACATCCCC 


3 0 0 


Ov 


301 


C CT GGGT GG CGAGAATGG C TT GGAT T AAT P AAGAAT T P T P G CT T CT AT A A T T A C A C* T G TT 


3 60 


Db 


301 


i i f i i i i i i i f i j i i i i i i f i i i i i i i r i i ; i i i i i f j i I i i i r i j i i i i i r f f j i i i i i 

CCTGGGTGGCGAGAATGGCTTGGATTAATCAAGAATTCTCGCTTPTATAATTA CA PTGTT 


3 6 0 


Ov 


3 61 


TGTCGCAATGGCATCAAAGAAAAGCATGGATTTGATT ATGGAA AG^^A^'TAPTTPA^ , A^iA^ , 

J- vj J. vJVwtifi J- vjjvjVwJTi a v^jrijr^jrVvjjr^jriJrijri\j x \ X JL X vJ/l X x X \jv^jr^JrifT.vjfVjjrl\ . X Jri.v_* X X v^i-i. \ .r-i. vxr~i 




Db 


3 61 


1 1 1 1 1 1 1 1 1 1 1 1 II II II 1 1 1 1 1 1 II 1 II 1 1 II 1 II 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 

TGTCGCAATGGCATCAAAGAAAAGCATGGATTTGATTATGGAAAGGAGTAPTTGAr'Zi.GAP 


42 0 


Ov 
vy 


421 


TT7VATPAPTAAPGAGAPJ-PATTA ATTAPTTPA AA ATGTPTA AGAPtAATPiTA r T 1 PPPPZ\TZ\nn 


Aon 
tow 


Db 


421 


MMMIIIMIMMIMMIMIIIMMIIIIIMIMMIMMIMIIIMMM 

TTAATCACTAACGAGAGPATTAATTAPTTPAAAATGTPTAAGAGAATGTATPPPPATAGG 


TO U 


Qy 


481 


CCCGTTATGATGGTGATCAGCCACGCTGCGCCCCACGGCCCCGAGGACTCAGCCCCACAG 


540 


Db 


481 


II M III Mill II III MM Mill III II Hill MUM MM MM III III MM 

CCCGTTATGATGGTGATCAGCCACGCTGCGCCCCACGGCCCCGAGGACTCAGCCCCACAG 


540 


Qy 


541 


TTTTCTAAACTGTACCCCAATGCTTCCCAACACATAACTCCTAGTTATAACTATGCACCA 


600 


Db 


541 


M 1 M II 1 1 M 1 1 II II M 1 M 1 1 1 II 1 M 1 M 1 1 1 M M 1 1 1 1 II 1 1 1 M 1 M 1 1 II 1 1 

TTTTCTAAACTGTACCCCAATGCTTCCCAACACATAACTCCTAGTTATAACTATGCACCA 


600 


Qy 


601 


AATATGGATAAACACTGGATTATGCAGTACACAGGACCAATGCTGCCCATCCACATGGAA 


660 


Db 


601 


IIMIIMIMIMIMIIIIIIIIIIMIMIIIMIMMMIIIIMMIMIIMI 

AATATGGATAAACACTGGATTATGCAGTACACAGGACCAATGCTGCCCATCCACATGGAA 


660 


Qy 


661 


TTTACAAACATTCTACAGCGCAAAAGGCTCCAGACTTTGATGTCAGTGGATGATTCTGTG 


720 


Db 


661 


i f J 1 1 1 r 1 1 1 1 r i f f r 1 1 1 1 1 1 f i f 1 1 1 1 j i r i i r f 1 1 1 1 1 f 1 1 1 1 1 1 1 1 f 1 1 1 1 1 r 1 1 1 

TTTACAAACATTCTACAGCGCAAAAGGCTCCAGACTTTGATGTCAGTGGATGATTCTGTG 


720 


Qy 


721 


GAGAGGCTGTATAACATGCTCGTGGAGACGGGGGAGCTGGAGAATACTTACATCATTTAC 


780 


Db 


721 


II III Ml II II Ml II III II II III II III MINI MM II II III Mill II MM 

GAGAGGCTGTATAACATGCTCGTGGAGACGGGGGAGCTGGAGAATACTTACATCATTTAC 


780 



3 O 

Qy 781 ACCGCCGACCATGGTTACCA^ 84Q 

Db 781 ACCGCCGACCATGGTTACCATATTGGGCAGTTTGGACTGGTCAAG^ 84 0 

Qy 841 JATGACTTTGATATTCGTGTGCCT^ 9Q0 

Db 841 TATGACTTTGATATTCGTGTGCCTTTTTTTATTCGTGGTCCA^ 900 

Qy 901 ATAGTCCCACAGATC^ ggo 

Db 901 ATAGTCCCACAGATCGTTCTCAACATTGACTTGGCCCCCACGATC 960 

Qy 961 CTCGACAC^CCTCCTGATGTGGAro 102Q 

Db 961 CTCGACACACCTCCTGATGTGGACGGCAAGTCTGTCCTCAAACTTCTC i 020 

Qy 1021 < j^' J ^" < ^ C ^ , ^ J ^^^*J^'^ < ^*^ , ^ '^1^''^* *^ < ^"^ - ^^ < j^"^^~^^^*j^"^^^ < ^'^^~^^^^^^ *~ C^-^"^^?^?^.'!? "X"ZC*C3C3- C3'X"<3 J ?^ri7^VCI!L?^*I7 T T-^^-OT C3- 1080 

Db 1021 CCAGGTAACAGGTTTCGAACAAACAAGAAGGCCA 1080 

Qy 1081 GAAAOAGGC^^ 114Q 

Db 1081 GAAAGAGGCAAAT TTCTACGTAAGAAGGAAGAATC CAG C AAGAATAT C CAACAGT C AAAT 114 0 

Qy 1141 CACTTGCCCAAATATGAACGGGT 1200 

Db 1141 CACTTGCCCAAATATGAACGGGTCAAAGAACTATGCCAGCAGGCCAG^ 1200 

Qy 1201 



Db 1201 TGTGAACAACCGGGGCAGAAGTGGCAATGCATTGAGGATA 1260 

Db 1261 CACAAGTGTAAAGGACCCAGTGACCTGcicACAGTC^ 1320 

Qy 1321 GCTCGCGGCTTCCATGACAAAGAC^ 138Q 

GCTCGCGGCTTCCATGACAAAGACAAAGAGTGCAGTTOTAGGGAGTC 



1380 



Qy 13 81 AG(^GAAGCCAAAGAAAGAGTCA^ 144Q 

Db 13 81 AGCAGAAGCCAAAGAAAGAGTCAACGGCAATTCTTGAGAAAC^ 1440 

Qy 1441 ^GCCCAGATTTGTCCATACTCGG^ 150Q 

Db 1441 AAGCCCAGATTTGTCCATACTCGGCAGACACGTTC^ 1500 

Qy 1501 ATATATGACATAAATCTGGAAGAA 1560 

Db 1501 ATATATGACATAAATCTGGAAGAAGAAGAAGAATTGCAAGTGTTGCA^ 1 560 

Qy 1561 ^^y^^^^^y ^^^^^y 1 ^"^^^^^^^^"^^^^^^^ CAAGAGA.TCTCCAGGJCTTCCAGTGGTGGC 1620 

Db 1561 GCTAAGCGTCATGATGAAGGCCACAAGGGGCCAAGAGATCTCCAGGCT 1620 

Qy 1621 ^CAGGGGCAGGATGCTGGCAGATAG^ 168Q 

Db 1621 AACAGGGGCAGGATGCTGGCAGATAGCAGCAACGCC^ 1680 



Qy 1681 GTGACACAC^GTGTTTTATTC 174Q 

Db 1681 GTGACACACAAGTGTTTTATTCTTCCCAATGACTCT i 740 

Qy 1741 CAATCGGCCAGAGCGTGGA^ 1800 

Db 1741 CAATCGGCCAGAGCGTGGAAGGACCATAAGGCATACATTGAC 1800 

Qy 18 01 < j--^-^ < ^ j ^-'y'-^^^^-T TA^^GA.A.T T T G^iG^GG^^CA.T C TGA-AG AGJVAGGAJ^G C CTGrAG I860 



Db 1801 CAAGATAAAATTAAGAATTTAAGAGAAGTGAGAGGACATC^ 1860 

Db 18 61 GAATGTAGCTGCAGTAAACAAAGCTATTACAA^ 1920 

Qy 1921 ^TTAAAGAGCCATCTTCACCCATT^ 1980 

Db 1921 AAATTAAAGAGCCATCTTCACCCATTCAAGGAGGCTGCTCAGG 1980 

Qy 1981 CAACTTTTCAAGGAGAACAACCGTAGG^ 2040 

Db 1981 CAACTTTTCAAGGAGAACAACCGTAGGAGGAAGAAGGAGAGGAA 2040 

Qy 2 041 AGGAAGGGGGAAGAGTGCAGCCTGCCTGGCCTCACTT^ 2100 

Db 2 041 AGGAAGGGGGAAGAGTGCAGCCTGCCTGGCCTCACTTGCT^ 2100 

Qy 2101 TGGCAGACAGCCCCGTTCTGGAACCTGGGATCTTTCTGTG 2160 

Db 2101 TGGCAGACAGCCCCGTTCTGGAACCTGGGATCTTTCTGTC 2160 

Qy 2161 ^CACCTACTGGTGTTTGCGTACAGTTAATGAGACGCATAATTTTC 2 220 

Db 2161 AACACCTACTGGTGTTTGCGTACAGTTAATGAGACGCATAATTOT 2 220 

Qy 2221 GCTACTGGCTTTTTGGAGTATTTT^ 2280 

Db 2221 gctactggctttttggagtattttgatatgaaUcagatccttatcagot 



2280 

Db 2281 GTGCACACGGTAGAArfiAP,flr'I'i'i'iJ'i-iirij<lij,liiJ / lilJ|J J HJ jliiJ ill [J I I 
Qy 2341 



AGCTGTCAAGGATM 2400 

Db 2341 AGCTGTCAAGGATATAAGCAGTGCAACCCAAGACCTAAGAA 2400 

Qy 2401 GATGGAGGAAGCTAT^ 2460 

Db 2401 GATGGAGGAAGCTATGACCTACACAGAGGACAGTTATGGGATC 24 60 

Qy 2461 GCCCCGTCTCACTGCAGACATCAACTGGC^^ 2520 

Db 2461 GCCCCGTCTCACTGCAGACATCAACTGGCAAGGCCTAGAGGA 25 20 

Qy 2521 ^CATCTATGAGTACA^ 2580 

Db 2521 AAACATCTATGAGTACAGACAAAACTACAGACT 2580 



1- 








Qy 


2581 


GAAGGATTTAGATAGAGTATTTGCACTGCTGAAGAGTCACTATGAGCAAAATAAAACAAA 


2640 


Db 


2581 


IIIIMIIIIIMIIMIIIIIIIMMIIIIIIMIIIIIIMIMIIIIIIIIMIM 

GAAGGATTTAGATAGAGTATTTGCACTGCTGAAGAGTCACTATGAGCAAAATAAAACAAA 


2640 


Qy 


2641 


TAAGACTCAAACTGCTCAAAGTGACGGGTTCTTGGTTGTCTCT?GCTGAGCACGCTGTGTC 


2700 


Db 


2641 


llllllllllilllllllllllllllllllllllllllllllllllllllllllllllll 

TAAGACTCAAACTGCTCAAAGTGACGGGTTCTTGGTTGTCTCTGCTGAGCACGCTGTGTC 


2700 


Qy 


2701 


AATGGAGATGGCCTCTGCTGACTCAGATGAAGACCCAAGGCATAAGGTTGGGAAAACACC 


2760 


Db 


2701 


1 M 1 1 M 1 M 1 II M 1 1 II 1 1 1 1 1 1 1 M 1 M 1 II M 1 1 1 1 1 1 1 1 M 1 II II 1 1 1 M M II 

AATGGAGATGGCCTCTGCTGACTCAGATGAAGACCCAAGGCATAAGGTTGGGAAAACACC 


2760 


Qy 


2761 


TCATTTGACCTTGCCAGCTGACCTTCAAACCCTGCATTTGAACCGACCAACATTAAGTCC 


2820 


Db 


2761 


IIIIIIIIIIIIIIIIIIIIMMMIIMMIMMMMMIIMMIIIIIMIIII 

TCATTTGACCTTGCCAGCTGACCTTCAAACCCTGCATTTGAACCGACCAACATTAAGTCC 


2820 


Qy 


2821 


AGAGAGTAAACTTGAATGGAATAACGACATTCCAGAAGTTAATCATTTGAATTCTGAACA 


2880 


Db 


2821 


IIIIIIIMIIMIIIIMMIMIIIIMIIIMIIIIIIIIMIIMIIIIIIIIMI 

AGAGAGTAAACTTGAATGGAATAACGACATTCCAGAAGTTAATCATTTGAATTCTGAACA 


2880 


Qy 


2881 


CTGGAGAAAAACCGAAAAATGGACGGGGCATGAAGAGACTAATCATCTGGAAACCGATTT 


2940 


Db 


2881 


IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIII 

CTGGAGAAAAACCGAAAAATGGACGGGGCATGAAGAGAGTAATGATCTGGAAACCGATTT 


2940 


Qy 


2941 


CAGTGGCGATGGCATGACAGAGCTAGAGCTCGGGCCCAGCCCCAGGCTGCAGCCCATTCG 


3000 


Db 


2941 


MMIIIIIIIIIIIIIIIMIIIIIIIIIIMIMIIIIIIIIIIIIMIMIIIMII 

CAGTGGCGATGGCATGACAGAGCTAGAGCTCGGGCCCAGCCCCAGGCTGCAGCCCATTCG 


3000 


Qy 


3001 


CAGGCACCCGAAAGAACTTCCCCAGTATGGTGGTCCTGGAAAGGACATTTTTGAAGATCA 


3060 


Db 


3001 


1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 

CAGGCACCCGAAAGAACTTCCCCAGTATGGTGGTCCTGGAAAGGACATTTTTGAAGATCA 


3060 


Qy 


3061 


ACTATATCTTCCTGTGCATTCCGATGGAATTTCAGTTCATCAGATGTTCACCATGGCCAC 


3120 


Db 


3061 


IIIIMIIIMIIIMIIIIIIMIIIIMIIIIIMIIIIMIIMIIMIMIIMM 

ACTATATCTTCCTGTGCATTCCGATGGAATTTCAGTTCATCAGATGTTCACCATGGCCAC 


3120 


Qy 


3121 


CGCAGAACACCGAAGTAATTCCAGCATAGCGGGGAAGATGTTGACCAAGGTGGAGAAGAA 


3180 


Db 


3121 


IIMIIIIIIIMIIIIIIMMMIMIMIMIMMMIMMIIIIMIMIMII 

CGCAGAACAC CGAAGTAATT C CAGCATAGCGGGGAAGATGTTGAC CAAGGTGGAGAAGAA 


3180 


Qy 


3181 


TCACGAAAAGGAGAAGTCACAGCACCTAGAAGGCAGCGCCTCCTCTTCACTCTCCTCTGA 


3240 


Db 


3181 


M 1 i 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II I 1 1 1 1 1 I I I I M 1 I | | | | | | I I | | | | | | | | | | | | M 1 1 
TCACGAAAAGGAGAAGTCACAGCACCTAGAAGGCAGCGCCTCCTCTTCACTCTCCTCTGA 


3240 


Qy 


3241 


TTAGATGAAACTGTTACCTTACCCTAAACACAGTATTTCTTTTTAACTTTTTTATTTGTA 


3300 


Db 


3241 


IIMMMIIIIIIIMIIMIIIMMIIIIIIIMIIIIIIIIIIIIMIIIIlllll 

TTAGATGAAACTGTTACCTTACCCTAAACACAGTATTTCTTTTTAACTTTTTTATTTGTA 


3300 


Qy 


3301 


AACTAATAAAGGTAATCACAGCCACCAACATTCCAAGCTACCCTGGGTACCTTTGTGCAG 


3360 


Db 


3301 


I ( 1 M ! 1 1 1 1 ! 1 1 1 1 1 1 1 ! 1 [ 1 1 1 1 [ 1 1 ! f 1 1 1 1 1 1 1 1 1 1 ] 1 1 1 1 1 1 1 1 M 1 1 1 f 1 II 1 1 

AACTAATAAAGGTAATCACAGCCACCAACATTCCAAGCTACCCTGGGTACCTTTGTGCAG 


3360 


Qy 


3361 


TAGAAGCTAGTGAGCATGTGAGCAAGCGGTGTGCACACGGAGACTCATCGTTATAATTTA 
1 1 II 1 1 1 1 II M 1 II 1 M 1 II 1 1 II 1 M 1 I M 1 1 1 II 1 1 I 1 1 II 1 M 1 1 1 1 M 1 1 1 1 1 1 1 


3420 


Db 


3361 


) 1 1 1 I I l i l l | li M | [ M | | M | M II [ M 1 1 M 1 I 1 M 1 1 II 1 1 1 1 M M | | 1 M | M 
TAGAAGCTAGTGAGCATGTGAGCAAGCGGTGTGCACACGGAGACTCATCGTTATAATTTA 


3420 


Qy 


3421 


CTATCTGCCAAGAGTAGAAAGAAAGGCTGGGGATATTTGGGTTGGCTTGGTTTTGATTTT 


3480 


Db 


3421 


IMMIIIimilMIIIMMIMIMIIIMIIIIIMIIIimilllllMIMI 

CTATCTGCCAAGAGTAGAAAGAAAGGCTGGGGATATTTGGGTTGGCTTGGTTTTGATTTT 


3480 











Qy 


3481 


TTGCTTGTTTGTTTGTTTTGTACTAAAACAGTATTATCTTTTGAATATCGTAGGGACATA 


3 54 0 


Db 


3481 


1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 II 1 1 1 1 II 1 II 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II M 1 

TTGCTTGTTTGTTTGTTTTGTACTAAAACAGTATTATCTTTTGAATATCGTAGGGACATA 


3540 


Qy . 


3541 


AGTATATACATGTTATCCAATCAAGATGGCTAGAATGGTGCCTTTCTGAGTGTCTAAAAC 


3600 


Db 


3541 


MMMIIM! MIMM Mill III MINIMI 1 MM IIIMMIMMMMIIM 

AGTATATACATGTTATCCAATCAAGATGGCTAGAATGGTGCCTTTCTGAGTGTCTAAAAC 


3600 


Qy 


3601 


TTGACACCCCTGGTAAATCTTTCAACACACTTCCACTGCCTGCGTAATGAAGTTTTGATT 


3660 


Db 


3601 


M 1 1 1 1 1 1 II 1 1 M II 1 II 1 II II II II 1 M II II M M II II II 1 M 1 II 1 1 II II II 1 

TTGACACCCCTGGTAAATCTTTCAACACACTTCCACTGCCTGCGTAATGAAGTTTTGATT 


3660 


Qy 


3661 


CATTTTTAACCACTGGAATTTTTCAATGCCGTCATTTTCAGTTAGATGATTTTGCACTTT 


3 72 0 


Db 


3661 


II II III II Ml MM II 1 III MM II III Mill Mill MIMM MIMM Ml M 

CATTTTTAACCACTGGAATTTTTCAATGCCGTCATTTTCAGTTAGATGATTTTGCACTTT 


3720 


Qy 


3721 


GAGATTAAAATGCCATGTCTATTTGATTAGTCTTATTTTTTTATTTTTACAGGCTTATCA 


3780 


Db 


3721 


II 1 1 II 1 II II 1 II II 1 II 1 II II M M 1 II II 1 M II M 1 1 1 II 1 II II 1 II II M 1 1 1 
GAGATTAAAATGCCATGTCTATTTGATTAGTCTTATTTTTTTATTTTTACAGGCTTATCA 


3780 


Qy 


3781 


GTCTCACTGTTGGCTGTCATTGTGACAAAGTCAAATAAACCCCCAAGGACGACACACAGT 


3840 


Db 


3781 


II II II 1 II II 1 M II II 1 1 M M II M 1 II M 1 M II 1 M I II II M II 1 II M 1 1 1 II 

GTCTCACTGTTGGCTGTCATTGTGACAAAGTCAAATAAACCCCCAAGGACGACACACAGT 


3840 


Qy 


3841 


ATGGATCACATATTGTTTGACATTAAGCTTTTGCCAGAAAATGTTGCATGTGTTTTACCT 


3900 


Db 


3841 


M II III Mill MIMM Mill MMMIMMIIIMMIMI Mill Mill II II 

ATGGATCACATATTGTTTGACATTAAGCTTTTGCCAGAAAATGTTGCATGTGTTTTACCT 


3900 


Qy 


3901 


CGACTTGCTAAAATCGATTAGCAGAAAGGCATGGCTAATAATGTTGGTGGTGAAAATAAA 


3960 


Db 


3901 


II M M M M II 1 II M II M II II 1 1 M 1 II M II 1 M II M II 1 II 1 1 II 1 II 1 1 1 1 1 

CGACTTGCTAAAATCGATTAGCAGAAAGGCATGGCTAATAATGTTGGTGGTGAAAATAAA 


3960 


Qy 


3961 


TAAATAAGTAAACAAAATGAAGATTGCCTGCTCTCTCTGTGCCTAGCCTCAAAGCGTTCA 


4020 


Db 


3961 


II 1 1 M M M II 1 M II II M M 1 II M 1 II M 1 II 1 II II M 1 II II 1 II M M 1 II 1 1 

TAAATAAGTAAACAAAATGAAGATTGCCTGCTCTCTCTGTGCCTAGCCTCAAAGCGTTCA 


4020 


Qy 


4021 


TCATACATCATACCTTTAAGATTGCTATATTTTGGGTTATTTTCTTGACAGGAGAAAAAG 


4080 


Db 


4021 


M 1 M II 1 M M M M 1 1 M M M 1 M II II 1 II II 1 II 11 1 M II II II 1 1 M II M II 

TCATACATCATACCTTTAAGATTGCTATATTTTGGGTTATTTTCTTGACAGGAGAAAAAG 


4080 


Qy 


4081 


ATCTAAAGATCTTTTATTTTCATCTTTTTTGGTTTTCTTGGCATGACTAAGAAGCTTAAA 


4140 


Db 


4081 


II 1 II II 1 II II II M II 1 II M M II 1 II II II 1 II II II 1 1 1 II II II 1 1 II 1 1 M 1 1 
ATCTAAAGATCTTTTATTTTCATCTTTTTTGGTTTTCTTGGCATGACTAAGAAGCTTAAA 


4140 


Qy 


4141 


TGTTGATAAAATATGACTAGTTTTGAATTTACACCAAGAACTTCTCAATAAAAGAAAATC 


4200 


Db 


4141 


II M 1 M 1 II II 1 II II M II 1 II M 1 II 1 11 II II 1 M II 1 II 1 II II 1 M II 1 M 1 1 1 

TGTTGATAAAATATGACTAGTTTTGAATTTACACCAAGAACTTCTCAATAAAAGAAAATC 


4200 


Qy 


4201 


ATGAATGCTCCACAATTTCAACATACCACAAGAGAAGTTAATTTCTTAACATTGTGTTCT 


4260 


Db 


4201 


II 1 M M 1 M M 1 M 1 M M II II 11 1 II 1 II 1 1 II 1 II 1 II 1 1 II 1 II II 1 II 1 1 II II 

ATGAATGCTCCACAATTTCAACATACCACAAGAGAAGTTAATTTCTTAACATTGTGTTCT 


4260 


Qy 


4261 


ATGATTATTTGTAAGACCTTCACCAAGTTCTGATATCTTTTAAAGACATAGTTCAAAATT 
1 ! 1 I 1 1 1 1 1 1 1 It M S II 1 1 1 1 1 1 I M 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 I 1 1 1 1 I 1 I 1 1 1 1 M 1 i 1 1 


4320 


Db 


4261 


M I M M 1 M I 1 M M M 1 1 1 1 1 II II 1 II 1 II II 1 II II 1 1 1 1 1 II II 1 1 1 II 1 1 1 II 1 

ATGATTATTTGTAAGACCTTCACCAAGTTCTGATATCTTTTAAAGACATAGTTCAAAATT 


4320 


Qy 


4321 


GCTTTTGAAAATCTGTATTCTTGAAAATATCCTTGTTGTGTATTAGGTTTTTAAATACCA 


4380 


Db 


4321 


Ml II MM MM Mill II Ml Mill II III MM III II Ml II Ml II II Ml III 

GCTTTTGAAAATCTGTATTCTTGAAAATATCCTTGTTGTGTATTAGGTTTTTAAATACCA 


4380 



3 3 



Qy 


4381 


^ '^^J^^^^ < ^ < ^ J ^'^ ^ J ^ C J' ^J" ^^^^^^^ < F'^^ e ^ ^^"^^"^^ "^"-^ ^ ^ ^ T *^ *^ T.?VX" T CZ!A.C3- C! "ZC™ 0? C C O^V.?^G^^TCj j ?^*X™ GtT 


4440 


Db 


4381 


GCTAAAGGATTACCTCACTGAGTCATC 


4440 


Qy 


4441 


GTTTTTGCTTATC 


4500 


Db 


4441 


GTTTTTGCTTACCCTAAGAGAGGTTTT 


4500 


Qy 


4501 




4560 


Db 


4501 


AAATTATGTTTTCTTTAAGTGTTTAT 


4560 


Qy 


4561 


AGCTG^TCTTTTTGCT 


4620 


Db 


4561 


AGCTGAATCTTTTTGGTAACTTTAA^ 


4620 


Qy 


4621 


AGCTGCT 


4680 


Db 


4621 


AGCTGCTTGCCTGATGTGTGTATCATCG 


4680 


Qy 


4681 


TG^TAATGTGCTTra 


4740 


Db 


4681 


TGAATAATGTGCTTTGTAAAAAGATTTCAACT 


4740 


Qy 


4741 


^^^^^^'y^'y ^ ^ C^TGATAC T XT TATAGAA C?^J\T T C! T GG C T T G AGGAAAG T C T AGAAG 

N 1 N 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | 1 1 1 1 1 1 1 1 1 1 1 II ! 1 1 | 1 1 1 1 I 


4800 


Db 


4741 


CATGTATAATATTCCATGATACTTTTATAGAACAATTCTGGCTTCAGGAAAGTC 


4800 


Qy 


4801 


C^TATTTCTTCAAATAAAAGGTGTTTAAACTTT 4 834 




Db 


4801 


CAATATTTCTTCAAATAAAAG^^ 4 83 4 





